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ABSTRACT 



f^S'7 ? f 
^ • / 

The ability to predict Marine officer attrition 

continues to be a significant obstacle to accurate manpower 
planning at Headquarters Marine Corps. This paper expands 
on a previous study which used unemployment in a linear 
regression model to predict attrition. Through the use of a 
binary choice model, this study incorporates a number of 
additional variables, principal of which is a performance 
index score to measure promotion potential. The logit 
function was employed to evaluate the inherent causal 
relationships and to further define the ability of 
statistical analysis techniques to predict officer 



attri tion. 
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uses a running average method to forecast attrition. This 
method uses the average annual attrition for the last six 
years adjusted by the intuitive assumptions of the officer 
responsible for determining the officer loss rate. This 
method is reliable only to a certain degree, with some years 
proving more accurate than others. The "off years" create 
significant complications for the efficient management of 
manpower planning. Because of this problem, HQMC requested 
a statistical study be undertaken to improve their predict- 
ing capabilities. The request was referred to the Naval 
Postgraduate School, and Major W. J. ESMANN, USMC, conducted 
an analysis which resulted in his thesis entitled, "Marine 
Officer Attrition Model". 

Major Esmann's initial research seemed to indicate a 
dramatic improvement in the ability to predict total attri- 
tion, but he concluded his thesis with a recommendation that 
further research be conducted using additional variables in 
a binary choice model. 

The first problem encountered by Major Esmann was to 
narrow the choice of variables which would be used to pre- 
dict attrition to a workable number. The initial choice of 
variables was made through interviews and the intuitive 
decisions of the officer conducting the analysis. At first, 
three categories of variables were identified as factors 
which could determine whether an officer would stay on 
active duty or leave the Marine Corps. These variables were 
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military pay, the economy (specifically unemployment), and 
promotion potential. Of these three variables, promotion 
potential was immediately eliminated because a method to 
measure that variable had not yet been developed. However, 
it is significant to note that throughout the study, those 
officers interviewed felt promotion potential would be a 
major factor in determining officer attrition. 

Next, military pay was studied using factors which 
showed its comparability with the private sector. A linear 
regression model was used to provide an insight to their 
predictive value. The results of this analysis indicated 
that comparability between military and civilian pay was not 
a significant factor in an officer's decision to leave the 
Marine Corps. These results contradicted a study conducted 
by the Center for Naval Analysis (CNA) which showed that pay 
comparability was a factor in an officer's decision to 
remain on active duty. However, the CNA study included all 
branches of the Armed Forces while Major Esmann's model 
considered Marine officers alone. The conclusion was that a 
Marine officer was a different breed of serviceman, not 
motivated by pay when making his decision to leave the 
Marine Corps [Ref. 1]. 

Once pay comparability and promotion potential were 
eliminated, only the economy remained as a primary factor. 
To determine which aspects were significant, the possible 
economic impact on attrition was broken into three areas: 
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unemployment, consumer price index (CPI), and the Gross 
National Product (GNP) . Using this economic data, initial 
tests were again run using a linear regression model with 
both the CPI and GNP proving insignificant as factors in 
determining officer attrition. 

Finally, unemployment was evaluated and segmented 
into professional-technical and managerial unemployment. 
These factors were assigned to groups within the data base, 
aviation officers being classified as professional-technical 
and ground officers as managerial. Again, a linear regres- 
sion model was used to determine the significance of this 
var iable . 

The data base used for Major Esmann's study was 
limited for several reasons. First, the data included in- 
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2 . Application and results 

As previously mentioned. Major Esmann's model was 
formulated to assist planners in two ways. First, it was 
designed to predict future manpower levels, an annual 
process whereby officer attrition determines accession 
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requirements and, therefore, recruiting goals. From these 
figures, the manpower budget is more accurately prepared and 
the budget call improved. Second, with a more accurate 
prediction of attrition, realistic eligibility zones can be 
determined, thus eliminating the mismatch between the number 
of vacancies and the number of officers selected for 
promotion . 

The results of the model, using unemployment as the 
key variable, produced significant improvements over the old 
averaging method. Again, the averaging method took the sum 
of the last six years attrition, by rank, and divided it by 
six. The value derived was then modified depending on the 
intuition of responsible officer. TABLE 1 shows the results 
of the averaging method. Major Esmann's model, the actual 
figures for FY83, and a prediction (using his model) for 
FY84. As depicted in TABLE I, a clear improvement can be 
seen over the averaging method. 

Major Esmann's study concluded that unemployment is 
a significant factor in predicting officer attrition. 
However, his concluding recommendation centered around add- 
ing a promotion potential variable to the unemployment 
statistics in order to improve the model's forecasting 
ability [Ref. 2] . 
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TABLE 1 



COMPARISON OF MODEL RESULTS 



Rank 


Avg 

Method 

1983 


Reg ress 
Model 
1983 


Actual 

1983 


% Error 
Avg 
1983 


% Error 
Reg 
1983 


Regress 

Model 

1984 


LtCol 


177 


152 


127 


39.4 


19.7 


152 


Major 


186 


149 


146 


27.4 


2.1 


164 


Capt 


452 


377 


392 


15.3 


-3.8 


407 


IstLt 


525 


416 


396 


32.6 


5.1 


425 



C. PLANNED IMPROVEMENTS 

With this initial background, the authors set out to 
expand the previous study and provide HQMC with an improved 
method for predicting officer attrition. The problem of 
establishing a variable that could measure promotion poten- 
tial, one of the key factors missing from Major Esmann's 
model, was solved using fitness report data provided by 
HQMC. An explanation of this process is covered in Chapter 
II of this thesis. 

Next, a number of interviews were conducted with offic- 
ers at HQMC responsible for predicting officer attrition. 
These interviews provided insight into the effectiveness of 
previous attempts to use Major Esmann's model. In essence, 
HQMC had achieved negative results using the Marine Officer 
Attrition Model during FY1984 and had reverted to the 
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averaging method which was proving more accurate. HQMC felt 
that although 1984 was not a typical year in terms of 
attrition, the averaging method provided a more accurate 
prediction than the Esmann model. These discussions gave 
some initial concerns but the authors believed the use of 
the performance index would prove to be critical factor in 
improving attrition prediction. 

In the subsequent chapters, the authors will describe 
the basic data as supplied by HQMC and the transformations 
required to create the working data base for this study; a 
description of the statistical functions available for this 
type of research, their strengths and weaknesses, and the 
eventual selection of the model that was used; the methodol- 
ogy employed during the data manipulation phase; and an 
analysis of the results with concluding comments. 
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DATA BASE 



II . 



A. SOURCE 



The data for this thesis were obtained from the Marine 
Corps Manpower Management System (MMS) and fitness report 
input. It consisted of 132,903 records containing nineteen 
data elements on every Marine officers on active duty from 
1977 through 1984. A record was created for each officer 
for each year, so that of the 132,903 records, somewhat less 
were actually in the data base due to multiple entries for 
those officers that remained on active duty during the 
period covered. A detailed example of the data supplied by 
HQMC is contained in Figure 2.1. The variables included 
wer e : 



(1) ID CODE. This is an encoded identification of the 
officer whose data appears on this line. 

(2) PMOS . This numeric discriptor indicates an offic- 
er's Primary Military Occupational Specialty. 

(3) FSTAMOS. This is the First Additional Military Occu- 
pational Speciality, or secondary skill the officer 
is qualified for. 

(4) ED MA J . This a numeric designator describing the 

officer's civilian education major. 

(5) RANK. This alphanumeric variable represents the pay 
grade equivalent of the officer's rank. 

(6) ADBDY , ADBDM, ADBDD. Combined, these variables 
indicate the Active Duty Base Date. This date is 
often not the actual date an officer first came on 
active duty, but has been adjusted for time spent 
in the reserves or as an enlisted person. 
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(7) FCOMMY, FCOMMM, FCOMMD. This is the date the offic- 
er was first commissioned. 

(8) SCORE. This is the officer's Performance Index 
Score, to be described later in this chapter. 

(9) POS SCR. This is the maximum, or Possible Score, 
that an officer could have attained for his 
length of service. 

(10) DATESEPY, DATESEPM, DATESEPD. This is the date 
the officer was separated from active service. 

A in these columns means the officer is still 

on active duty. 

(11) CYC YR. This is Manpower Management System date 
indicating the end of the record year this data 
was compiled from. 

The first change made to the basic data supplied was to 
eliminate those categories of officers whose presence could 
bias the results or that would have negligible impact on the 
outcome. To this end, all second lieutenants were elimin- 
ated because they would be promoted prior to reaching a 
decision point in their career. Those officers with more 
than four years of enlisted service, warrant officers, and 
limited duty officers were also eliminated because a prelim- 
inary statistical analysis indicated they tended to be ca- 
reer oriented with virtually none leaving the service prior 
to their twentieth year. The number of officers, by rank, 
remaining in the data base are displayed in Table II. 

In addition to the personal data provided through the 
MMS, data pertinent to an individual's decision to remain on 
active duty were obtained from a number of other sources. 
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Figure 2.1 



Sample of 



the Basic Data. 
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TABLE II 



RANK REPRESENTATION IN DATA BASE 



Colonel 



4,072 



Lieutenant Colonel 



10,327 



Major 



20,532 



Capta i n 



26,750 



First Lieutenant 



26,250 



TOTAL 



87,931 



First, from the officer's fitness reports, HQMC provided 
a performance SCORE which was determined by weighting each 
of the "Performance” and "Qualities" blocks (see Figure 2.2) 
of the report and taking this value times a numeric value 
assigned to the mark received (Outstanding, Excellent, 
etc.). The maximum score assigned, or POS_SCR, is the score 
that would have resulted had the officer received a mark of 
"Outstanding" in each of the observed areas. Both SCORE and 
POS_SCR are cumulative, with senior officers having a higher 
value for these variables than a more junior officer. 

Second, a relationship between military and civilian 
pay, expressed as a percentage difference of the former from 
the latter, was obtained from the government's Employment 
Cost Index [Ref. 3] and added to the data base for each year 
of the period covered. The figures used in our analysis are 
displayed in Table III. 
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TABLE III 



1 

1 


MILITARY PAY 


COMPARABILITY 


1 

1 


1 

1 


Year 




Difference 


1 

1 

1 


1 

1 

1 


1977 




CO 

• 

1 


1 

1 

1 


1 

1 


1978 




-4.3 


1 

1 

1 


1 

1 


1979 




-6.5 


1 

1 

1 


1 

1 

1 


1980 




-7.3 


I 

1 

1 


1 

1 


1981 




1 

• 

00 


1 

1 

1 


1 

1 

1 


1982 




-0.4 


1 

1 

1 


1 

1 

1 


1983 




-4.2 


1 

1 

1 


Last 


, managerial 


and technical executive unemployment 


figures 


were obtained 


from the Bureau of Labor and 


Statis- 


tics. 


These figures 


[Ref. 


4] , which were lagged 


by six 


months 


to bring them 


in line 


with a date closer to 


the time 



an officer made the decision to resign, are displayed in 
TABLE IV. 

B. TRANSFORMATIONS 



Although some of the data could be used directly, many 
of the required elements needed to be derived from the basic 
data as supplied. 

An individual's performance INDEX was determined by 
dividing his SCORE by his POS_SCR. Due to the inflation of 
fitness reports, the INDEX covered a very narrow range. 
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TABLE IV 



1 

1 


LAGGED UNEMPLOYMENT RATES | 


1 

1 Year 

1 


1 

Managerial Tech. Executive | 


1 1977 


.0295 .0310 1 


1 

1 1978 


1 

.0245 .0280 1 


1 

1 1979 

1 


1 

.0210 .0250 1 

1 


1 

1 1980 

1 


1 

.0225 .0245 1 

1 


1 

1 1981 

1 


1 

.0255 .0265 1 

1 


1 

1 1982 

1 


1 

.0315 .0305 1 

1 


1 

1 1983 


1 

.0355 .0320 1 



The inflation of fitness reports during the period 
also required the INDEX be normalized to a common mean. This 
was accomplished using the following formula; 

INDEXn = (INDEX - INDEXrm) / Std Dev 



where : 




INDEXn 


= Normalized Index 


INDEX 


= given 


INDEXrm 


= Index mean by rank and years of service 


Std Dev 


= Standard Deviation corresponding to the 

INDEXrm 



To determine if an individual's education major played a 
significant role in is decision to remain on active duty, 
the Marine Cor. Manpow.r Management System(MMS) 
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alphanumeric code for education major was converted to a 
binary (1 and 2) choice that could be used with the LOGIST 
procedure of SAS. This was accomplished in a somewhat 
subjective manner based on the marketability of the 
individual's level and type of degree. As an example, those 
individuals possessing degrees in the soft skills, such as 
business administration and education were given an EDSPEC 
of 1 while those with more technical degrees such as 
computer science or electrical engineering were given an 
EDSPEC of 2. 

The variable OUT was derived from the DATESEPY element 
of the basic data. If an individual had a entry for this 
element, it indicated he had been released from active duty 
and was assigned an OUT variable value of 1. An individual 
remaining on active duty received a value of 0. 

YEARS, or the number of years each officer had been on 
active duty, was simply his active duty base date (ADBDY ) 
minus the data cycle year (CYC_YR) , a date used by the MMS 
to mark the annual update of individual's record. 

Because colonels are assigned a 99XX series PMOS, or 
numerical designator for an officer's primary military occu- 
pational specialty, upon promotion, it was necessary to 
create a new variable entitled MOS that would correlate the 
actions of these officers with those of lesser grades. For 
lieutenant colonel and below, MOS equalled PMOS, but for 
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colonels, MOS equalled their first additional MOS (FSTAMOS), 
or the MOS they had been assigned prior to promotion. 

Finally, a determination was required to accurately pre- 
sent the unemployment statistics, or the UNEMP variable. 
Like Major Esmann, the authors assigned Naval Aviators and 
Naval Flight Officers the unemploment figures associated 
with technical executives while the other officers were 
assigned the managerial statistics. This was clearly an 
subjective de ision, but one which appeared to most correct- 
ly reflect the employability of the population. 

An example of the revised data appears in Figure 2.3. 
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III. MODEL SELECTION 



A. GENERAL 

To assist in the analysis and prediction of Marine 
officer attrition, it was necessary to select a computer- 
assisted model that would most appropriately handle the 
unique aspects of this particular study. Previous studies 
of the subject had used multivariate regression analysis. 
Although this procedure was somewhat successful, it was 
apparent that other mo'jels were available that might prove 
more adept at accurately predicting an officer's decision to 
remain in the Marine Corps or leave the service. Therefore, 
following the collection and refinement of the data base, a 
thorough analysis of the available models was conducted to 
determine the algorithm that would most accurately predict 
the binary "stay / leave" decision. 

1 . Linear Multiple Regression 

Linear multiple regression analysis is probably the 
most commonly used tool of the statistician who wishes to 
develop a model that will predict a dependent output from a 
series of known, independent inputs. This form of regres- 
sion analysis is concerned with modeling the relationship 
among the independent variables. It quantifies how a 
response, the dependent variable, is related to a set of 
explanatory, independent variables. If the true relation- 
ship among the variables were known precisely, the 



26 



statistician would be able to accurately predict the 

outcome. However, the true relationship is rarely known and 
approximations must be developed from empirical evidence. 

To accomplish this approximation, the concept of 
least squares is used to fit the regression equation to the 
observed data. The least squares method requires that a 

line be chosen to fit the data so that the sum of the 
squares of the vertical deviations separating the points 
from the line will be a minimum. These deviations are 
represented by the lengths of the vertical line segments 
that connect the points to the estimated regression line. 
In the case of multiple regression, the line is replaced by 
a hyperplane with dimensions equal to the number of 

independent variables. The resulting estimated multiple 
regression equation would be; 

Y = a + b, X, + b^ Xj . . . + b„ X„ + e 
where : 

Y = The value of Y calculated from the 

regression equation. 

a = The value of the constant or "y- 

intercept " . 

b„ = The estimated regression coefficient of 

a particular independent variable. 

X„= An independent variable. 

e = Residual error associated with the model 
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As can be seen, the major goal of linear multiple 
regression is to predict Y from the regression plane for 
given levels of X. This is an excellent tool, with some 
reservations, if the independent variables are continuous 
and randomly distributed (weight, speed, or payload) and 
the independent variable will also be continuous, such as 
cost [Ref. 5]. But, if the investigator wishes to predict 
the either/or behavior of an individual or group based on 
the observed past performance of a similar group, then the 
traditional forms of regression analysis are not able to 
accurately predict their response or action because the 
independent variable is qualitative, or at least observed to 
be qualitative, and not quantitative. Voting, the decision 
to marry, have children, or go to college are examples of 
qualitative decisions that are not adequately handled by 
linear multiple regression. What is needed, therefore, is a 
set of statistical techniques that can do the work of multi- 
ple regression but with qualitative choices as the indepen- 
dent variables. Following will be a discussion of two of the 
more commonly used procedures to accomplish this task. 

2 . Discriminant Analysis 

Discriminant Analysis deals with a specific class of 
statistical problem that focuses on the study of groups of 
populations or data sets. In general it is assumed that the 
groups being analyzed are discrete and identifiable, that 
each observation in the group can be described by a set of 
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measurable variables, and that the variables have a normal 
distribution within each group. With this in mind, the 
purpose of discriminant analysis is to test for mean group 
differences and to describe the overlaps among the groups 
(the decision to leave the Marine Corps for instance, is a 
different one than the decision to stay so that the 
probabilities may add to more than one due to this overlap). 
Furthermore, discriminant analysis will construct a classi- 
fication scheme based on the variable set and thereby create 
a predictive model with which to determine the probable 
actions of similar groups. 

To use discriminant analysis as a predictive tool, a 
concept similar to regression analysis called the 
discriminant function is employed. However, unlike the 



linear regression problem. 


the 


dependent 


variable 


is 


dichotomous. Th 


is function 


uses 


a weighted 


combination 


of 


the independent 


variables to 


classify a res 


ponse into 


a 


variable group. 


Furthermore , 


using a weighted 


sum of val 


ues 


derived from 


the individual 


predictor 


variables. 


a 


discriminant sco 


re is calculated. 


A cutoff 


score is then 



determined that, used in conjunction with the discriminant 
score, assigns those observations with discriminant scores 
higher than the cutoff score to one of the criterion groups 
while those observations with discriminant scores less than 
the cutoff are assigned to the other [Ref. 6]. The 
discriminant function is expressed as follows: 
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L = b, X, + . . . + b„X„ 

where : 

L = Discriminant score 

b„ = Weights associated with the predictors 

X„= Values of the predictor variables 

The default cutoff score in most automated systems, such as 
SAS or SPSS, is .5 with the user being able to modify the 
breakpoint, if desired, to weight the outcome. This would 
be done to reduce the number of misclassif ications in a 
particular direction. 

In general, discriminant analysis allows the 
statistician to classify objects into groups based on their 
values as predictor variables and, much like multiple 
regression, creates a model that can predict individual or 
group behavior based on the actions of a similar population 
base from the past. One of the key factors in accurately 
predicting behavior, however, is that there be a large 
difference between the criterion groups. This was certainly 
not the case with one of the primary variables used in the 
study of Marine officer prediction, namely the performance 
INDEX. Due to the inflation of the fitness report marks 

during the period covered by the data base, it was found 
that the vast majority of the INDEX values fell into a very 
narrow range. Thus the difference between an outstanding 
officer and a less capable one was numerically very small. 
This led to classification errors and therefore less than 
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acceptable results when trying to determine what percentage 
of a particular group would leave the Marine Corps. 

3 . Logit 

Logit is a binary-choice model that assumes that 
individuals faced with a choice between two alternatives 
will make a decision based on their respective characteris- 
tics. Unlike discriminant analysis, which postulates that 
the observed values of individual characteristics and 
attributes of alternatives are drawings from posterior dis- 
tributions conditioned on actual responses, logit proposes 
that the actual responses are drawings from multinomial 
distributions with selection probabilities conditioned on 
the observed values of individual characteristics and attri- 
butes of alternatives. With this in mind, it can be seen 
that the logit model can be used to calculate the probabili- 
ty of an individual making a particular choice, such as to 
get out of the Marine Corps, conditioned on an observation 
of some specific elements of information about that particu- 
lar officer. To accomplish that process, the function takes 
the general form; 

Pj 

Z] = Log ( ) = b, X,j + bjXjj + ... b„ Xjj + e 

1 - Pj 

where ; 

Zj = Probability that a particular decision will be 
made given a particular variable grouping 

Pj = Probability that an individual will make a 
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Z, = Male, high 
Zj, = Male, medium 
Zj = Male, low 
Z^ = Female, high 
Zg = Female, medium 
Z^ = Female, low 

Once these probabilities have been calculated in conjunction 
with a particular choice, for instance to get out, the model 
is run against the population to be tested and the individ- 
ual probabilities summed to determine the number predicted 
to leave the Marine Corps. For example, using the following 
test data. 



TEST DATA 

Obs Sex Income Z 

1 M H .34 

2 F M .69 

3 F H .52 

4 M L .23 

5 M L .23 

6 M M .38 

2.39 

the logit function predicted that 2.39 officers would be 
leaving the Marine Corps. 

Since the logit function makes far fewer assumptions 
than the discriminant model, such as no multivariate 
normality for covariates, this form of regression analysis 
is often preferred to discriminant analysis. In addition, 
the theory that an officer would leave the Marine Corps 
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because of pay di irences, his educational background, or 
perceived probab* ity of promotion suggests a causal rela- 
tionship between the explanatory variables and the probabil- 
ity of his separation. Here again, in side-by-side 
comparisons, the logit function has proved to be somewhat 
more accurate in forecasting the direction the dependent 
variable will take [Ref. 7]. 

B. MODEL SELECTION 



Based on the foregoing analysis, and the availability of 
the appropriate models at the Naval Postgraduate School, the 
authors elected to use the logit function to conduct this 
study. As implemented by the Statistical Analysis System 
(SAS), the logit function is entitled LOGIST. Supporting 
the basic program are a number of standard statistical tools 
to assist the user in the stepwise elimination of i.nsignifi- 
cant variables. As a guide to the reader in evaluating the 
SAS LOGiST output, a sample printout is included in Figures 
3.1 - 3.4, with the critical points explained below. 



1. This matrix depicts the percentile distribution of a 
particular group based on their individual 
performance INDEX. This matrix, although not part 
of the LOGIST procedure, was included to assist in 
the analysis by displaying the number of officers 
falling into each of the ten categories. This 
breakout by INDEX gives the investigator a detailed 
view of why the model is, or is not, performing as 
expected. In this example, there is a wide differ- 
ence between those staying in (the column headed 
"y") and those leaving (the column headed "1"). 
Although this difference slowly fades as the INDEX 
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scores increase, this model should perform very well 
in predicting officer attrition because of the dis- 
tinct classification process that has occured. 

2. This is the number of observations in the group con- 
sidered, less those with missing data. 

3. Displayed in this area is the basic statistical data 
on the independent variables. 

4. The P-value for testing the joint association of 
all variables in the model with the dependent 
variable is displayed at this point. In this 
example, the P-value of 0 indicates the model is 
totally significant. 

5. These are the P-values of the individual variables 
deleted from the model as insignificant. 

6. Displayed here are the estimated regression coeffi- 
cients for each variable in the model. 

7. This column contains the estimated standard errors 
of the regression coefficient estimates. 

8. This column contains the P-value of the variables 
retained by the model as being significant. 

9. This column depicts the lower confidence limit of 
the predicted probability. 

10. Displayed in this column is the predicted probabil- 
ity of this particular officer leaving the Marine 
Corps given the values of the independent vari- 
ables associated with this observation. 

11. This column contains the upper confidence limit 
of the predicted probability. 

12. This is the summation of the predicted probabilities 
for this group, which equals the total number of 
officers the model predicts will be getting out of 
the Marine Corps [Ref. 8]. 

This concludes a description of the model selection 
procedures. In Chapter IV, the authors will analyze the 
output resulting from the application of the logit model to 
each of the groups studied. 
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Figure 3.1 Sample SAS Output 
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Figure 3.2 Sample SAS Output 
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Figure 3.3 Sample SAS Output 
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IV. ANALYSIS 



A. GENERAL 

The initial step of the a-ialysis was to perform a 
preliminary study of the basic data. Through this review, 
the authors selected and, if necessary, transformed the 
independent variables which they felt would prove useful in 
predicting attrition. Along with this procedure, the 
dependent variables were transformed from the basic data to 
reflect a binary choice decision. With this accomplished, 
the basic data were purged to eliminate biased information. 
Through these procedures, a data base was created from which 
eight population groups were selected and evaluated using 
the SAS LOGIST procedure. 

B. INDEPENDENT VARIABLES 

As indicated in Chapter III, manipulation of the data 
was required to create a file which would contain the 
significant factors to conduct the analysis. Once the data 
base was transformed into a workable file, the independent 
variables to be used in the study were selected on the basis 
of intuitive decisions by the authors. 

The first variable selected was INDEX. This variable 
was the primary reason for conducting this follow-on 
analysis of officer attrition. However, the binary choice 
model enabled the authors to add additional independent 
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indication of his career potential in the Marine Corps. 
Those officers with a PMOS of 0302 (infantry) and 0802 
(artillery) would seem to be taking the "ideal" career path 
of a Marine ground officer. Both authors have observed that 
officers without the above specialties did perceive 
themselves to be less promotable and therefore more likely 
to attrit when the opportunity arose. 

C. DEPENDENT VARIABLE 

The dependent variable for this model was entitled 
"OUT". This variable positioned a 1 in every record which 
had a separation date and a 0 in those records for which 
there was no separation date. 

D. SCOPE OF STUDY 

1 . General 

In order to remove data that would bias the outcome 
and to limit the scope of the study to a reasonable number 
of population groups, two decisions were required prior to 
the application of the logit function to the data. First, 
it was determined that a large portion of the population, 
for which the decision to attrit was either insignificant or 
not a factor at all, would have to be eliminated from the 
data base. Second, once this group was deleted from the 
data, a decision was required to determine which of the many 
possible rank by years-of-service population groups could be 
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analyzed in the time allotted and still provide a accurate 
projection of officer attrition. 

2 . Biased Data Elimination 

In keeping with the Esmann study, the authors 
eliminated all ground second lieutenants from the analysis 
because their initial contracts would require them to remain 
on active duty until after they would have been promoted to 
first lieutenant. By the same criterion, aviators were also 
not evaluated until they had reached the rank of captain. 

Additionally, the authors eliminated all limited 
duty officers and warrant officers. As explained in Chapter 
II, these officers have normally amassed tenure from prior 
enlisted service and their committment to remain on active 
duty is strengthened by their selection for promotion to 
officer status. 

3 . Model Population Groups 

The initial intention was to run the independent 
variables against the dependent variable "OUT" by rank. 
However, further study clearly showed that to run this model 
against a population categorized by rank only would give a 
very biased result. A captain with 6 years of service faced 
with the decision to leave the Marine Corps is influenced 
by different factors than a captain with 9 years of service. 
As discussed in Chapter II, the variable YEARS was created 
from the basic data by subtracting an individual's active 
duty base date (ADBDY) from the data cycle year (CYC YR) . 
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Through this transformation, the variable YEARS provided the 
authors with the ability to further separate the RANK 
populations by the number of years an officer had been on 
active duty. This procedure greatly expanded the number of 
population groups which could be evaluated. To reduce the 
complexity of this problem, a program was written to show 
the frequency of attrition by year and rank. These data 
proved useful in evaluating the final results, as well as 
indicating which years by rank were the critical years for 
attrition. 

The authors, using the results of this analysis, 
determined that a satisfactory evaluation of Marine officer 
attrition could be accomplished by studying five years by 
rank groupings. The significant year is that point in an 
officer's career when the greatest number of his peers leave 
the Marine Corps. As shown in Table V, the significant 
years for each rank were: First lieutenants, year 3; 
captains, year 5; majors, year 11; lieutenant colonels, year 
20; colonels, year 26. Also depicted in Table V are the 
actual attrition figures for FY 81, the year used to 
establish the validity of the LOGIST model. 

Once this decision was reached, the only remaining 
factor was to determine whether ground and aviation officers 
should be separated and tested independently. The decision 
was, like the Esmann model, to test these groups 
independently. The authors realized that the aviation and 
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TABLE VI 



GROUP/VARIABLE 


CORRELATION 


Data Group 


Variables Used 


1st Lt (Gnd) 


INDEX, PAY, UNEMP(Mgr) 
MOS, EDSPEC 


Capt (Gnd) 


same as 1st Lt (Gnd) 


Capt (Air) 


INDEX, PAY, UNEMP(Tech) 
EDSPEC 


Maj (Gnd) 


same as Capt (Gnd) 


Maj (Air) 


same as Capt (Air) 


Lt Col (Gnd) 


same as Capt (Gnd) 


Lt Col (Air) 


same as Capt (Air) 


Col 


INDEX, PAY, UNEMP(Mgr) 



EDSPEC 



(Infantry and Artillery) and the remainder of the 
population. For colonels, the authors eliminated any MOS 
distinction because these officers have made a critical 
transition to an executive level where MOS looses its 
significance. Because the aviation population groups were 
tested independent of the ground officers, MOS was not a 
factor and therefore eliminated. 

E. OUTPUT 

1 . General 

Once the data groups were established, the authors, 
using the LOGIST procedure, analyzed the data groups in two 
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areas. First, the authors fit the models to the FY77 
through FY80 data using the LOGIST function. This was 
accomplished by the LOGIST procedure, using the stepwise 
mode, which provided statistical data on those variables 
remaining in the model as well as those deleted because they 
failed at the .05 level. The estimated models resulting 
from this procedure provided the output that was then used 
to produce a probability, or P-value, for each individual to 
be tested in the FY81 and FY82 forecast years. The 
summation of these individual probabilities gave the 
predicted attrition for the population group being analyzed. 
Comparing the predicted number with the actual figures 
provided by HQMC, the authors were able to calculate a 
prediction rate for each population group. For consistency, 
this prediction rate was calculated as an absolute value , 
always using the lower of the two numbers (actual or 
predicted) as the numerator. 

Because the primary purpose of this analysis was to 
evaluate the predictive capabilities of the logit model of 
Marine officer attrition, prediction rates are presented 
first. Then the significance of the variables in the 
estimated models are discussed. To assist the reader, the 
authors have included in the discussion that follows for 
each population group, a table ranking the variables in 
order of significance by P-value of their associated Chi 
Square tests. In addition, using a procedure unrelated to 
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LOGIST, interval tables displaying the performance index by 
percentile were addressed. 

2 . First Lieutenants (Ground) 

a. Prediction Rates 

Prediction rates for this group showed varying 
results. For FY81 actual attrition for ground officers in 
this data group was 183. LOGIST predicted a total of 186 
officers would attrit during FY81 for a prediction rate of 
98 percent. Encouraged by these results the authors decided 
to evaluate FY82 for a more c'^nclusive study. For FY82 
the results of the model were far less accurate. Actual 
attrition for FY82 was 169 while LOGIST provided a forecast 
of 140. These figures equated to an 83 percent prediction 
rate or far below the rate achieved with the FY81 model. 
Figure 4.1 displays 43 observations from the FY81 data 
group, along with the predicted probability (column P) of 
each officer leaving the Marine Corps. Below the 
observations, in the "SUM" column, is a summation of the 
predicted probabilities, or the total number LOGIST predicts 
will leave the Marine Corps in FY81 and FY82 

b. Significance of Variables 

All variables within this population group 
proved significant at the .05 level. Performance INDEX 
showed the lowest P-value indicating its high level of 
significance to the model while PAY was the least 

significant with a P-value closest to .05. Table VII 
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First Lieutenants (Ground) 
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each variable and 



shows, by their order of significance, 
their corresponding P-value and the Chi Square statistic. 

Figure 4.2 shows the final parameter estimates 
of each variable with its associated P - statistic and 
regression coefficients of each of the significant 
variables. These results would seem to indicate that the 
junior officer is most concerned with his level of success 
during his initial military contract agreement. Yet 
another, probably more important factor, is the officer's 
realization that the Marine Corps is not what he expected. 
This happens quite early in a young officer's career and 
his performance and attitude drop off quickly relative to 



that of his peers, 
therefore performance. 


This difference 
is reflected in 


in outlook, 
the officer's 


and 

INDEX 


1 

1 

1 

1 

1 


TABLE VII 

VARIABLE SIGNIFICANCE, FIRST LIEUTENANTS 


1 

1 

1 

1 

1 


1 

1 

1 


Variable 


P - Value 


Chi - Square 


1 

1 

1 


\ 

1 

1 


INDEX 


• 


241.65 


1 

1 

1 


1 

1 

1 


UNEMP 


0 . 0000 


28.49 


1 

1 

1 


1 

1 

1 


EDSPEC 


0.0001 


16.14 


\ 

1 

1 


1 

1 

1 


MGS 


0.0310 


4.65 


1 

1 

1 


1 

1 

1 


PAY 


0.0423 


4.12 


1 

1 


score 


and a clear distinction is made 


between the 


two 



groups. Because only those officers with the higher scores 
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LIEUTENANT (GNDI WITH 3 YEARS 







STEPWISE LOGISTIC REGRESSION PROCEDURE 








DEPENDENT VARIABLE: 


OUT 






4108 


OBSERVATIONS 










31 79 


OUT = 0 










929 


OUT = 1 










1052 


CBSERVATIUNS 


DELETED DUE TO 


MISSING VALUES 




VARIAB LE 




MEAN 


MINIMUM 


MAX 1 MU M 


RANGE 


INDXN 




0.881927 


0.325ol 7 


1 


0.674383 


PAY 




-5.88715 


-7. 3 


-4.5 


2.6 


UNEMP 


0 


.0238633 


0.021 


0.0295 


0.0085 


EDSPEC 




1.15287 


1 


2 


1 


MOS 




1.3313 


1 


2 


1 



-2 LJ) LIKELIHUOD FO?l MODEL CONTAINING INTERCEPT ONLY= 4392.07 
STLP 0. THE rOLLOHlNG VARIABLES ARE ENTERED; 

INIERCLP INUXN PAY UNEMP EUSPEC MOS 



MODEL ;HI-SQUARE= 335.68 WITH 5 D.F. 
CCNVERiENCE OBTAINED IN 5 ITERATIONS. 
MAX ABSOLUTE DE R I VA I I V E = 0 .9 B 76D-0 7 . 
MODEL CHI-SQUARE= 317.76 WITH 5 D.F. 



ISCORc STAT.I P=*0.0 

R= 0.265. 
-2 LOG L= 4074.31. 
1-2 LOG L.R.I P*0.0 



FRACTION OF CJNCURJANT PAIRS OF PREDICTED PROBABILITIES AND RESPONSES :0.654 
RANK CORRELATION BETWEEN PREDICTED PROBABILITY AND RESPONSE :0.335 



VAR lABLE 

INTERCEPT 

INUXN 

PAY 

UNEMP 

EDSPEC 

MOS 



FINAL 

BETA 

10.05289643 

-9.20897329 
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STD. ERROR 

0 .86800D10 
0.59240968 
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Figure 4.2 First Lieutenants (Ground) 
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continue to maKe the Marine Corps their career, the 
distinction between those with the greater promotion 
potential and those who will fail selection or get out is 
much less later in their careers. Since the dependent 
variable is the probability of separating from active 
service, one would expect a negative relationship between 
the INDEX score and the dependent variable. This negative 
coefficient clearly indicates a strong negative correlation 
between a higher INDEX score and an officer's decision to 
leave active duty. 

Following INDEX is the UNEMPloyment variable. 
This would seem to indicate that many of these young 
officers initially made the military their choice of 
employment because of the high unemployment rates they faced 
upon graduation from college. Three years later, these same 
officers, faced with a similar decision, opt for the 
security of a military career, making t e high umemployment 
rate more profound. Supporting this view is the negative 
coefficient associated with the UNEMPloyment variable, 
which indicates that as the unemployment rate increases, 
the probability that an officer will leave active duty 
decreases . 

The educational background of this data group 
also proved to be highly significant. Again, since the 
dependent variable is the probability of separating from the 
Marine Corps and a higher education level would seem to 
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indicate increased employability, the expected sign of this 
coefficient would be positive. However, in this case the 
negative coefficient assigned to the EDSPEC variable 
indicates that those officers with the more technical, and 
therefore more saleable, degrees are remaining in the Marine 
Corps. 

The remaining variables, MOS and PAY, were less 
significant than the above variables but not enough to be 
deleted from the model. While MOS and PAY are factors in 
this model it is understandable why they are less 
significant. In the case of MOS, the junior officer is at a 
point in his career that the significance of a combat arms 
MOS has little effect on his outlook regarding career 
aspirations in the Marine Corps. If it has, the officer is 
still at a point in his career to allow him the flexiblity 
to change his MOS to one of the combat arms. Nevertheless, 
the positive coefficient associated with this variable 
indicates, as expected, that those officers with the combat 
arms MOS ' s are remaining in the Corps. The least 
significant of all of the variables was PAY. Interestingly 
enough, the positive coefficient associated with this 
variable indicates that as pay becomes less comparable with 
the private sector, attrition decreases. This apparent 
anomaly is unexplained. Officers in this data group, for 
the most part, are experiencing their first full-time job. 
The military offers these junior officers a comparable 
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salary to their peers in the civilian sector and therefore 
they perceive their pay to be adequate. 

c. Performance Index Interval Table 

Figure 4.3 shows ten equal sub-group intervals 
within the first lieutenant population group. These sub- 
groups show actual attrition rates for each index interval 
during the period FY77 through FY80. For this data group 
only one interval shows an "out" (1) percentage which is 
greater than the "in" (0) percentage. This is interval 1, 
or all those officers with a performance index less than or 
equal to .796879. At the other end of the scale these 
percentages change, showing that officers with the higher 
index have the greatest retention rates. This table clearly 
indicates that for this data group, the Marine Corps is 
retaining those officers with the higher performance index 
while those at the extreme bottom are attriting at a 
significant rate. 

3. Captains (Ground) 

a. Prediction Rates 

Like the previous data group, the prediction 
rate for captains in the ground community for FY81 was 
extremely high. Actual attrition for FY81 was 51 officers 
while LOGIST predicted 50 officers would attrit for a 98 
percent prediction rate. As with the previous data group, 
the model was also evaluated with FY82 data in order to 
confirm its validity. The FY82 results were far less 



54 



LIEUTE^^ANT ( 
TABLE OF 

INTERVAL OUT 

FREWUENCYl 
PERCENT 
ROW PCT 
COL PCT 



GNO) WITH 3 YEARS 
INTERVAL BY OUT 



INTERVAL I 



INTERVAL 2 



INTERVAL 3 



INTERVAL ^ 



INTERVAL 5 



INTERVAL 6 



INTERVAL 7 



INTERVAL 8 



INTERVAL 9 



INTERVAL 10 



TOTAL 



-+ l.O 



1 0 


1 


102 
4.4L 
47. 15 
5.71 


204 
4.95 
52. J5 
21.02 


306 

7.42 

60.46 

9.59 


141 

3.42 

31.54 

15.00 


332 1 
0.05 
76.15 
10.41 1 


\ 104 

2.52 
23.85 
1 11.12 


342 

0.29 

70.00 

10.72 

1 "* 361 

0. 75 
03.76 
1 1 1 « 32 

¥ i 

332 

8.05 

00.00 

10.41 


92 

2.23 

21.20 

9.04 

70 

1.70 

16.24 

7.49 

03 

2.01 

20.00 

0.00 


340 

0.24 

03.33 

L0.66 

f ■( 

324 
7.85 
03.51 
10. 16 


60 

1.65 

16.67 

7.27 

64 

1.55 

16.49 

6.04 


354 
8.50 
06. 76 
LL.LO 


54 1 
1.31 
13.24 
5.70 1 


1 317 

7.60 
05.22 
9.9^ 


55 

1.33 

14.70 

5.08 



3190 

77.33 



935 
22. b1 



TOTAL 



366 

9.36 



^47 

10 . 0 ^ 



^36 

10.57 



^34 

10.52 



^31 

10.45 



415 

L0.06 



400 

9.09 



300 

9.41 



400 

9.89 



372 

9.02 



4125 
LOO. 00 



Figure 4.3 First Lieutenants (Ground) 
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encouraging than those for FY81. Attrition during FY82 was 
43 officers while LOGIST forecasted that 32 officers would 
leave the Corps. This equated to a 74 percent prediction 
rate. Figure 4.4 shows the results of this analysis, 
b. Significance of Variables 

With this particular data group, only two 
variables remain in the model at the .05 significance level. 
As discussed below, these variables, INDEX and EDSPEC, play 
a significant role in an officer's decision to attrit from 
the Marine Corps. Here again, the negative coefficient 
associated with the INDEX variable indicates that officers 
with higher performance index scores are much more likely to 
remain on active duty. The remaining variables used (PAY, 
UNEMPloyment, and MOS) were deleted from the model because 
they proved to be insignificant at the .05 level. Table 
VIII shows the variables used to predict attrition in this 
group ranked from those proving to be the most significant 
to those having no impact on the model's ability to predict. 

The final parameter estimates along with the 
variables deleted from the model are shown in Figure 4.5. 
As in the previous data group, INDEX is again the most 
significant variable. The output from these two data 

groups would seem to indicate that an officer's performance 
level plays a major role in his decision to either leave or 
stay in the Marine Corps. The other variable in the model, 
EDSPEC, may remain a factor because these officers, with 
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P = 0.0 
R= 0.204. 

1021 .3C. 
P = 0.0 



FRACTION UF L3NCURDANT PAIRS UF PREOICTEJ PROBAS I L I T I fc S AND KESPUNSFS : 0 . 62 4 
RAliK CORRELATION BETWEEN PKCUICTEU PROBABILITY AND RESPONSE :0.296 



TEST FUR ALL VARIABLES NUT IN THE MJUEL 
lESlDUAL CHI-SOUARE- 6.32 WITH 3 L).F. P = 0.09 70 



HtSIUUAL CHl-5gUARE IS NUT SIGNIFICANT AT THE 0.0500 LEVEL. 

CAPTAINS (GNU) WITH 5 YEARS 
STEPWISE LOGISTIC REGRESSION PROCEDURE 

DEPENDENT VARIABLE: UUT 



FINAL parameter ESTIMATES 



VAR lABL E 



BETA 



STL. ERROR CHI-SOUAP.E P K 



INTERCEPT 5.383 3305 3 

INOXN -9.3C5P37CJ 

EDSPEC a. 072543 7'J 



I .63300552 
1 .82010263 
0 . 1791 3430 



10.30 0.0010 

26.1V 0.0000 -0.150 

23. 71 0.0000 0.142 



Figure 4.5 CapLains (Ground) 
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TABLE VIII 



VARIABLE SIGNIFICANCE, CAPTAINS (GND) 



1 Variable 

1 


P - Value 


Chi - Square | 

1 


1 

1 INDEX 

1 


0.0000 


26.14 1 

1 


1 

1 EDSPEC 


0.000C 


1 

23.71 1 

1 


1 

1 MOS 

1 


0.0720 


1 

6.27 1 

1 


1 

1 UNEMP 

1 


0.1020 


1 

3.04 1 

1 


1 

1 PAY 

1 


0.5458 


1 

0.36 1 

1 


five years of 


service, are very employable in the civilian 


market based 


on their educational 


background and job 


experience. 


Supporting this idea 


is the positive 


coefficient for the EDSPEC variable. 


Unlike the first 



lieutenant population group, this positive relationship 
indicates that those officers with the more technical 
educations have a higher probability of leaving the service. 

A number of variables were deleted from the 
model. MOS and PAY may have proved insignificant for the 
same reasons they were less than totally significant in the 
previous data group. The deletion of the UNEMPloyment 
variable may explain the reason EDSPEC becomes significant 
to the model. A technical education, along with five years 
of experience as a military manager, may make civilian 
employment more available for this group than the national 
unemployment rate indicates. 
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c. Per jrmance Index Interval Table 

Th. table again shows the highest attrition 
rate occuring at interval 1, or those officers having a 
performance index less than or equal to .843051. Indicated 
is that out of 111 officers in this sub-group, 40 have left 
the service for a 36 percent attrition rate. The remaining 
intervals are generally within the same range with interval 
2 showing an "in" rate of 87.5 percent with a 12.5 percent 
"out" rate and interval 10 having an 88 percent "in" rate 
with an 11.9 percent "out" rate. This table clearly shows 
that those officers with the lowest performance index, 
interval 1, attrit at the greatest rate. Figure 4.6 shows 
the interval table with corresponding attrition rates. 

4 . Captains (Air) 

a. Prediction Rates 

Unlike the previous data groups analyzed, LOGIST 
was unsuccessful in predicting attrition for both FY81 and 
FY82. Actual attrition for FY81 was 67 officers. The 
LOGIST sum predicted 29 officers would attrit for a 43 
percent prediction rate. For FY82 the actual attrition was 
26 officers while LOGIST forecasted 6 officers would attrit 
for a 23 percent prediction rate. Figure 4.7 depicts the 
predicted attrition rates provided by LOGIST. The reasons 
for the model's poor performance are discussed following 
Table IX. 
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INTbRVAL 

FKEQUEMCY 
PEKCfcNr 
KUW PCT 
LCL PCT 



CAPTAINS (GNO) WITH 5 YEARS 
TABLE OF INTERVAL BY OUT 
OUT 



INTERVAL 



♦.343051 

INTERVAL 2 



♦. 8621 86 

INTERVAL 3 



INTERVAL 4 



-♦.8^1161 



INTERVAL 5 



♦.900515 

INTERVAL 6 



INTERVAL 



•♦.911001 

7 



INTERVAL 8 



♦. 934047 

INTERVAL 9 



INTERVAL 10 



TOTAL 



1.0 



1 0 1 


! ^ ! 


5.18 
63.96 
1 5.97 

91 1 

6.64 
87.50 

7.65 1 

^ i 

115 1 
8. 39 
85.19 

9.66 1 


40 1 
2.92 
36.04 
22.10 1 

1 1 
0.95 
12.50 
1 7.18 1 

20 
1.46 
14.81 
1 11.05 


izV\ 

9.12 
88.65 
10.50 1 

98 
7. 15 

88.29 
8.24 

♦ i 

132 1 
9.63 

94.29 
11.09 1 


1 

1.17 
11.35 

1 8.34 

13 
0.95 
11. 71 

7.18 

1 8 
0.58 
5.71 
1 4.42 


117 

8.53 

91.41 

9.83 


11 

o.eo 

8.59 
6. 08 


140 I 
10.21 
88.05 
11.76 I 


1 19 

1.39 
11.95 
1 10.50 


154 1 
11.23 
88.00 
12.94 1 


21 
1.53 
12.00 
1 11.60 


147’| 
10.72 
88.02 
12.35 1 


1 20 1 
1.46 
11.98 



1190 

86.80 



181 

13.20 



TOTAL 

111 

8.10 



104 

7.59 



135 

9.85 



141 

10.28 



111 

8.10 



140 

10.21 



128 

9.34 



159 

11.60 



175 

12.76 



167 

12.18 



1371 

100.00 



Figure 4.6 Captains (Ground) 
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b. Significance of Variables 

In the case of captains (Air), all variables 
remain in the model with INDEX again proving the most 
significant. The variables are shown in Table IX with the 
P-value and the associated Chi Squares. 





TABLE IX 




VARIABLE 


SIGNIFICANCE, 


CAPTAINS (AIR) 


Variable 


P - Value 


Chi - Square 


INDEX 


0.0000 


26.14 


PAY 


0.0016 


10.01 


UNEMP 


0.0054 


7.74 


EDSPEC 


0.0315 


4.62 



Figure 4.8 shows the variables and pertinent 
statistics provided through the LOGIST procedure. While the 
LOGIST model proved unsuccessful in predicting attrition for 
this group, an analysis of the variables may give some 
insight as to its failure. Like the ground community, the 
aviation element places a high level of significance on 
performance. However, PAY becomes more significant within 
this population group. This is reflected by the negative 
coefficient associated with the PAY variable which shows a 
direct connection between higher military pay and retention. 
This may explain the need of the Marine Corps to pay bonus 
dollars in order to retain these officers. The UNEMPloyment 
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CAPTAINS (AIR) WITH 5 YEARS 
STfcPWISE LOGISTIC REGRESSION PROCEDURE 

DEPENDENT VARIABLE: OUT 

793 OBSERVATIONS 
503 OUT = 0 

210 OUT = I 

2^^ OBSERVATIONS DELETED DUE TO MISSING VALUES 



VARIABLE 

INDXN 

PAY 

UNEMP 

EUSPEC 



MEAN 

0.B92J73 

-5.95183 

0.0269552 

1.23221 



MIN IMUM 

0.694634 
-7. 3 
0.0245 
1 



MAXIMUM 

0.993156 

-4.5 

0.031 

2 



RANGE 

0.298522 

2.8 

0.0065 

1 



-2 LOS LIKELIHOOD FOR MODEL CONTAINING INTERCEPT ONLY* 916.76 
TfcP 0. THE FOLLOWING VARIABLES ARE ENTERED! 

NIERCEP IVDX^ PAY UNEMP EDSPEC 



MODEL CHI-SgUARE= 59.30 WITH 4 D.F. 
CCN^ERiENCE OBTAINED IN 5 ITERATIONS. 
MAX AdSCLUTE DERIVATIVE=0.355I0-0B. 
MOOLL CHI-SgUARE* 59.64 WITH 4 D.F. 



(SCORE STAT.) P=0.0 

R= 0.237. 
-2 LOG L* 85 7. 12. 
(-2 LUG L.R.) P*0.0 



RACTIUN OF C3NC0RJANT PAIRS OF PREDICTED PROBABILITIES AND RESPONSES 
AA4K CORRELATION BETWEEN PREDICTED PROBABILITY AND RESPONSE 



:0.65I 

:0.325 



VARIABLE 

INTERCEPT 

INDXN 

PAY 

UNEMP 

LOSPEC 



FINAL PARAMETER ESTIMATES 
BETA STD. ERROR CHI-SUUARE 



1.71863564 
-12. 750J36B2 
-0.49904236 
188.2B1I6708 
0.404B1351 



3.385816B7 

2.06195064 

0.15774012 

67.6T09332B 

0.18824171 



0.26 

38.24 

10.01 

7.74 

4.62 



0.6117 

0.0000 

0.0016 

0.0054 

0.0315 



-0.199 

-0.093 

0.079 

0.054 



TIUE 



CLASSIFICATION TABLE 
PREDICTED 





NEGATIVE 


POSITIVE 


NEGATIVE 


572 [ 


11 


POSITIVE 


1B4 1 


26 


TOTAL 


756 T 


37 



TOTAL 

583 

210 

793 



sensitivity: 12.4? SPECIFICITY: 98.1? CORRECT: 75.4? 
FALSE POSITIVE RATE: 29.7? FALSE NEGATIVE RATE: 24.3? 



kmCIIUN of CJNCORUANT pairs of predicted probabilities and RESPONSES 
Af.K CORRELATION BETWEEN PREDICTED PROBABILITY AND RESPONSE 



:0.651 

:0.325 



Figure 4.8 Captains (Air) 
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variable remains significant while the variable closest to 
.05 is EDSPEC. Although EDSPEC remains in the model, it has 
the least significance compared to the captain (Ground) 
model. This may be the result of all of the officers in 
this group having a technical education. One interesting 
observation is that in this population group, all variables 
remain in the model. Similar results were achieved within 
the first lieutenant model. Both models show officers 
making a decision on whether to remain in the service after 
their initial contract obligation. 

c. Performance Index Interval Table 

Figure 4.9 shows the interval table with the 
corresponding percentages. This interval table, like the 
ones that precede it, indicates that attrition is greatest 
in the first interval. Interval 1, depicting those officers 
with an INDEX score less than or equal to .840127, shows a 
53 percent attrition rate in this population sub-group. 
While attrition is greatest at interval 1, the next three 
intervals also show a significant attrition rate when 
compared with the remaining intervals in the table. The 

combined attrition rate for these three intervals is 31 
percent. Again, this table shows attrition is greatest for 
those with the lower performance index scores. 

5. Majors (Air and Ground) 

Although the LOGIST procedure was utilized to analyze 
these two data groups, the attrition rates were too small to 
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CAPTAINS lAIR) WITH 5 YEARS 

TABLE OF INTERVAL BY OUT 

INTERVAL OUT 

FREQUENCY 
PERCENT 
ROW PCT 
COL PCT 

♦ 0 

INTERVAL I 



INTERVAL 2 



INTERVAL 3 



INTERVAL 4 



INTERVAL 5 



INTERVAL 6 



INTERVAL 7 



INTERVAL 6 



INTERVAL 9 



INTERVAL 10 



TOTAL 



1 0 1 


I_1 [ 


43 1 


1 50 1 


5.40 


6.27 


46.24 


53.76 


7.35 ! 


1 23.58 1 


55 


^ 24 1 


6.90 


3.01 


69.62 


30.38 


9.40 


11.32 1 


56 1 


i 24 


7.03 


3.01 


70.00 


30.00 


9.57 1 


11.32 


f 




48 


25 


6.02 


3.14 


65.75 


34.25 


8.21 


11.79 1 


1 1 


1 ^3 


7.65 


1.63 


82.43 


17.57 


I 10.43 1 


1 6.13 






57 


10 1 


7.15 


1.25 


85.07 


14.93 


9.74 


4.72 1 


1 


zvy] 


7.65 


2.63 


74.39 


25.61 


1 10.43 

f i 


9.91 1 


54 


14 1 


6.78 


1.76 


79.41 


20.59 


1 9.23 


6.60 1 


76"| 


1 


9.54 


2.13 


81.72 


18.28 


12.99 1 


1 8.02 






1 


14 1 


9.28 


1.76 


84.09 


15.91 


I 12.65 


6.60 1 


^ 585* 


212 


73.40 


26.60 



l.O ♦ "T 



TOTAL 



93 

11.67 



79 

9.91 



80 

10.04 



73 

9.16 



74 

9.28 



67 

8.41 



82 

10.29 



66 

8.53 



93 

11.67 



88 

11.04 



797 

1 00.00 



Figure 4,9 Captains (Air) 
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provide a worthwhile analysis. Actual attrition for FY81 
for both the air and ground communities was four officers! 
During FY82 a total of five officers within these data 
groups left the Marine Corps. A review of majors mith 11 
years of service showed similar attrition rates from FY77 
through FY83. With this level of attrition, the authors 
believed a worthwhile analysis of the data was impossible 
and discontinued any further study of this group. 

6 , Lieutenant Colonels (Ground) 
a. Prediction Rates 

The LOGIST procedure provided poor FY81 and FY82 
prediction rates for this data group. For FY81, actual 
attrition was 34 Marine officers while LOGIST predicted 45 
officers would leave the Marine Corps, or a 76 percent 
prediction rate. The results for FY82 were even less 
promising. Actual attrition was 15 officers compared to a 
LOGIST prediction of 44, or a 34 percent prediction rate. 
Figure 4,10 shows the results of this model for both FY81 
and FY82. 



b. Significance of Variables 

Figure 4.11 shows the final parameter estimates 
for this model. This data group retained two variables in 
the construction of the model. As with all of the other 
data groups studied, INDEX was the most significant. The 
other variable retained in the model was MOS, continuing the 
trend (discussed below) that in the ground community at 
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LT cuts (JNUI WITH 20 YEARS 
STEPWISE LUGISTIC REoRESSlCJN PRUCEUUkE 



DhPENOENT VARIA6LE: OUT 



CBSERVATIONS 
46J OUT = 0 
135 OUT = I 

157 QBSEKVATIONS OELETEi) DUE TO MISSING VALUES 



VARIABLE 

INOXN 

PAY 

UrJEMP 

EUSPEC 

MOS 



MEAN 

0.86A7A9 
-5, 77731 
0.02<f'V io9 
1. 16 321 
1 . 60^94 



MINIMUM 

0*789441 

-7.3 

0.021 

I 

I 



MAXIMUM 

0.983742 

-4.5 

0.0295 

2 

2 



RANGE 

0. 194301 
2.8 
0.0085 
I 
I 



-2 LO; LIKELIHOOD PUR MUOEL CONIAINING INTERCEPT ONLY- 

STEP 0. THE FOLLOWING VARIABLES ARE ENTERED: 

INTEKCEP INUX>J PAY UNEMP EOSPEC MOS 

MUULL CHI-SQUAKE= 73.46 WITH 5 O.P. I SCORE STAT.I 

CUNVERiENLE OBTAINEO IN 5 ITERATIONS. 

MAX ABSOLUTE DEK I VA TIVE= 0. 79L1U-06 . -2 LOG L= 

MODEL :HI-SJUAkE= 78.36 WITH 5 O.f. |-2 LUG L.R.I 



775.10 



P=0.0 
K — 0.2 9 7. 

696. 74. 
P=0.0 



FRACTION OF C SNCORDANT PAIRS OF PREDICTED PRUBAB I L I T I c S ANO RESPONSES :0.707 
RANK CURRCLATIQN BETWEEN PREDICTED PROBABILITY ANO RESPONSE :0.430 



FAST BACKWARD ELIMINATION 



VARIABLE 

DELETED CHI-SCUAKE 



RESIDUAL 

P CHI-SDJARc D.F. 



P 



tCSPEC 

PAY 

UNEMP 



0.03 

3.05 



3.8h 



0.8674 0.03 
0.0803 3.08 
0.0500 6.92 



1 0.8674 

2 0.2148 

3 0.0746 



S TE P I . 



MODEL SHI-S.UAKE= 67.37 WITH 2 D.F. 
CLNVERiENLE UBTAINEO IN 5 ITERATIONS. 
MAX ADSCLUTE Dt R I V A T I V £ = 0. 70 75D-0 7 . 
model CHl-SaUARE= 71.29 WITH 2 D.F. 



ISCURE STAT.I P=0.0 

K= 0.295. 
-2 LUG L= 703.81. 
(-2 LUG L.R.) P^O.U 



FRACTION a CJNCUKDANT PAIRS OF PREOICTtD PRO B ABI L I I I £ S AND RESPUNSES :0.698 
RAfJK CORRELATION BETWEEN PREDICTED PROBACILITY AND RESPONSE :0.413 



TEST FOR ALL VARIABLES NOT IN THE MUDLL 
RESIDUAL CHI-SQJARE= 6.99 WITH 3 D.F. P=0.0723 



RESIDUAL CHI-SCUARE IS NOT SIGNIFILANT AT THE 0.0500 LEVEL. 

LT CUES ICNDI WITH 20 YEARS 



DEPENDENT VARIABLE: CUT 



FINAL PARAMETER ESTIMATES 



VAR I A6L E 



BETA 



STD. Error lhi-souare p r 



INTERCEPT -I 7. '^2502086 
INDXN 20. 2 7835291 

MOS -0.97168114 



2.52255510 
2.8V204932 
0.19 163512 



47. 72 O.OUOO 
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least, having a combat arms MOS is perceived to be important 
to an officer's future in the Marine Corps. Variables 
deleted from the model were UNEMP, PAY and EDSPEC. Table X 
shows the variables for this data group. 

While INDEX remained the most significant 
variable throughout this study, the positive coefficient 
assigned to INDEX shows a distinct break from the preceding 



VARIABLE 


TABLE X 

SIGNIFICANCE, LIEUTENANT 


COLONELS (GND) 


Variable 


P - Value 


Chi - Squar 


INDEX 


0.0000 


47.72 


MOS 


0.0000 


25.71 


UNEMP 


0.0500 


3.84 


PAY 


0.0808 


3.05 


EDSPEC 


0.8678 


0.03 



population groups. This indicates 
this study, that those officers wi 
index scores have a greater probabi 
output supports the data displayed 
Interval Table discussed below, 
group placed a high level of sign 
would seem to indicate that afte 
lieutenant colonel, officers beg 
assignments as a means of enhancing 



, for the first time in 
th the higher performance 
lity of attrition. This 
in the Performance Index 
Additionally this data 
ificance on MOS. This 
r reaching the rank of 
in to regard their 30 b 
their future chances for 
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promotion. Marine officers in this data group with combat 
arms MOS's have had more command assignments, making their 
future promotion potential greater than those who have never 
served in this type of job due to their restrictive MOS's. 

Addressing the variables deleted, EDSPEC seems 
to become less significant as the officer moves further away 
from his undergraduate years. This supports the authors' 
original impressions about this variable, believing it would 
prove significant only for the younger officers. PAY was 
also deleted. This is because these officers have spent 
twenty years in the military, and any real concerns with pay 
would have forced them to attrit at an earlier stage in 
their careers. UNEMPloyment is just insignificant at a P- 
value of .05, which may indicate that the civilian 
positions this group are seeking are not related to the 
national unemployment rate. 

c. Performance Index Interval Table 

Figure 4.12 shows the interval table with the 
corresponding attrition percentages. With this data group, 
the performance INDEX does a complete reversal from the 
previous data groups. In the previous populations, 
attrition was highest among the intervals representing the 
lowest index scores. A percentile breakdown of the 
performance INDEX scores of this data group reveals that 
those officers with the highest scores attrit at the 
greatest rate. Interval 1, or those officers with a score 



71 



INTERVAL 

FREgUENCYl 
PERCENT 
ROW PCT 
CUL PCT I 

INTERVAL I 



INTERVAL 2 



INTERVAL 3 



LT CULS IGNO) with 20 YEARS 
TAbLE UF INTERVAL BY OUT 
OUT 



INTERVAL A 



-♦.866850 



87J68A 



INTERVAL 5 



INTERVAL 6 



♦.892‘fSA 

INTERVAL 7 



♦.905655 

INTERVAL 8 



♦.916338 

INTERVAL 9 



INTERVAL 



TOTAL 



-♦. 929695 
10 



L.O 



1 0 1 
^ 

54 

8.33 

83.08 

11.66 

1 55 

8.49 
84.62 
1 11.88 


l_l j 

11 

1.70 

16.92 

5.95 

10 

1.54 

15. 

5.41 


55 

8.49 
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11.83 


10 

1.54 

15.38 

5.41 


1 69 j 
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1 14.90 I 


1 15 1 

2.31 
17.66 

1 8.11 j 


33 
5.09 
70.21 
7. 13 


14 

2.16 

29.79 

7.57 


44 1 
6. 79 
67.69 
9.50 1 


1 21 
3.24 
32.31 
1 11.35 


43 

7.41 
73.85 
10.37 

39 

6.02 

60.94 

8.42 


17 

2.62 

26.15 

9.19 

25 

3.06 

39.06 

13.51 


36 1 
5.56 
55.38 
7.78 1 


1 29 

4.48 
44,62 
1 15.68 

► 

1 

5.09 
52.38 
1 17.84 


30 
4.63 
47.62 
6.43 1 


463 

71.45 


185 

28.55 



TOTAL 



65 

10.03 
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10.03 



65 

10.03 



84 

12.96 



47 

7.25 



65 

10.03 



65 

10.03 



64 

9.88 
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10.03 



63 
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less than or equal to .838258, have an "in" percentage of 33 
percent and an "out" percentage of 16.9. On the other hand, 
interval 10 has an "out" rate of 52 percent and an "in" rate 
of 47.6 percent. A review of this table clearly shows that 
those officers in the higher interval subgroupings attrited 
at the greatest rate. This would indicate the Marine Corps 
is losing those officers with the higher performance INDEX 
scores from this data group. 

7 . Lieutenant Colonel (Air) 



a. Prediction Rates 

The application of LOGIST to this population 



group produced 


mixed 


results 


for FY81 and FY82. 


The 


prediction 


rate 


for 


FY81 was 


93 percent, with 


LOGIST 


forecasting 


14 


officers against 


an actual attrition 


of 13 


officers . 


For 


FY82, 


LOGIST 


predicted 13 officers 


would 



leave the Marine Corps when actually 16 officers resigned 
their commissions. This resulted in a predictability rate 
of only 81 percent for FY82. Figure 4.13 displays the 
LOGIST output for this data group. 

b. Significance of Variables 

INDEX was the only variable that remained in 
this model. The input variables for this model were similar 
to those used to analyze the lieutenant colonel (Ground) 
population group with the exception of MOS. MOS was not 
used with this group because, like the other groups 
involving pilots, they had been arranged by MOS initially. 
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The variables evaluated in this model are shown below in 
Table XI. 

INDEX again emerges as the primary variable for 
determining attrition for this group and, like the 
lieutenant colonel (Ground) population group, the 
coefficient is positive. The other variables analyzed by 
the model were deleted for the same reasons given during the 
discussion of the previous data group. Figure 4.14 shows 



VARIABLE 


TABLE XI 

SIGNIFICANCE, LIEUTENANT 


COLONELS (AIR) 


Variable 


P - Value 


Chi - Squar 


INDEX 


0.0U01 


14.93 


UNEMP 


0.4113 


0.67 


PAY 


0.6854 


0.16 


EDSPEC 


0.9985 


0.00 



the final parameter estimates of this data group along with 
pertinent statistics on the variables deleted from the 
model . 

c. Performance Index Interval Table 

Figure 4.15 shows the table for this population 
group with attrition and retention percentages by interval. 
Although the total number of officers appearing in this 
table is much less than the lieutenant colonel ground table, 
239 versus 648, the results are similar. Interval 1, 
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LT COLS UIR) hITH 20 YEARS 
STEPWISE LOGISTIC REGRESSION PROCEDURE 



DEPENDENT VARIABLE: OUT 



23 9 C13 SERVATIONS 
laa OUT - 0 

51 uur = 1 

62 08SLRVATIUNS UELETkJ DUE TO MISSING VALUES 



VARIABLE 


MEAN 


MIN IflUM 


MAX I MUM 


RANoE 


INDXN 


0.091865 


0.796900 


0.97049 7 


0. 173A 99 


PAY 


-5.63 703 


-7.3 


-4.5 


2.8 


UNEMP 


0 .02 74142 


0.0245 


0.031 


0.0065 


EDSPEC 


1.27615 


I 


2 


1 



-2 LO; LIKELIHUCJD TOR MUOEL CONTAINING 
STEP 0. THE FOLLOWING VARIA3LES ARE ENTERED: 
INTERCEP 1NDX9 PAY UNEMP ED3PEC 

MODEL :HI-SUUAKE= 16.94 WITH # D.E. 

convek;en^e ubtainld in 5 iterations. 

MAX AGSCLUTE L)E K 1 VAT I V E = 0 .o 8 200-0 / . 
MUUEL SHI-SUUAKE^ 17.63 WITH 4 D.F. 



INTERCEPT ONLYs 247.30 



ISCOKE STAT.) P=0.0020. 

0.197. 

-2 LOG L= 230.17. 
1-2 LUG L.R.) P=0.0015. 



FRACTION UF CJNCflROANT PAIRS UF PREDICTED PRO8A0 IL I T I ES AND RESPUNSES :0.68l 
RANK CURRELATION BETWEEN PREDICTED PRU6AJILITY AND RESPUNSE :0.382 



FAST BACKWARD ELIMINATIUN 



VARIABLE 






RESIDUAL 




DELETED 


CHI-SUUARE 


P 


CHI-SwUAKC 


D.F . 


EDSPEC 


O.DO 


0.9985 


0.00 


1 


UNEMP 


0.67 


0.^113 


0.67 


2 


PAY 

1 . 


0. 16 


0. 6354 


0.84 


3 



0.9935 

0.7136 

0.8401 



MOULL :HI-S0UARE= 16.03 WITH I D.F. 

CCNVERJENCE UUTAINEJ IN 5 ITERATIONS. 
MAX AOSCL'JTE UER I VAT I VE = 0.92 0OD-O J. 
MUDEL .hI-SgUAKE= lb. 81 WITH 1 D.F. 



ISCURE STaT.) 



-2 LOG L 
1-2 LOG L.K.) 



P^O.OOOl. 
K= 0.2‘t4. 

231.00. 

P=0.0000. 



FRACTION CF CONCORDANT PAIRS OF PREDICTED PKUO A3 1 L I T I E S AND RESPONSES :0.68b 
RA/TK correlation BETWEEN PkEDICTED PKUOAGILITY AND RESPUNSE :0.391 



TEST FUR ALL VARIABLES NUT IN THE NODLL 
IESIDUAL CHI-SUUARE- 0.84 WITH J D.F. P=0.3392 



RESIDUAL CHI-jUUARE IS NOT SIGNIFICANT AT THE 0.0500 LEVEL. 

LT COLS (AIR) WITH 20 YEARS 
STEPWISE LOGISTIC RtGRESSION PRUClDURE 

DEPENDENT VARIABLE: U J T 



FINAL PARAMETER ESTIMATES 



VAR I ABLE 



Bfc lA 



STD. ERROR CHI-SOUARE P R 



irjTERCEPT -20.50C546J6 
INDXN 21.3782201T 



5 .001 0599 7 
5.53319721 



16.30 0.0000 

14.93 0.0001 0.228 



Figure 4 . 14 
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LT CULS (A 
TABLE UE 

INTERVAL OUT 



FREQUENCY 
PERCENT 
ROW PCT 
COL PCT 



IR) WITH 20 YEARS 
INTERVAL BY OUT 



INTERVAL 1 



INTERVAL 2 



INTERVAL 3 



interval 4 



INTERVAL 5 



INTERVAL 6 



INTERVAL 7 



INTERVAL 8 



INTERVAL 9 



INTERVAL 10 



0 



TOTAL 



l.O ♦’ 



! ^ 1 


1 1 j 


21 
8. 79 
91.30 
11.17 


2 

0.84 
8. 70 
3.92 


19 1 

7.95 
82.61 
10.11 1 


‘ i^r 

17.39 
1 7.84 


22 
9.21 
91.67 
11. 70 

f ■{ 

20 
8.37 
90.91 
1 10.64 


2 

0.84 

8.33 

3.92 

0.84 
9.09 
3.92 1 


21 
8. 79 
87.50 
11.17 


3 1 
1.26 
12.50 
5.88 I 


1 

7.95 
79.17 
1 10.11 
f < 

23 

9.62 

92.00 

12.23 


5 

2.09 

20.83 

9.80 

2 

0.84 

8.00 

3.92 


18 

7.53 

72.00 

9.57 


z.,l j 

2 8 . 00 
13.73 1 


11 

4.60 

44.00 

5.85 


14 

5.86 

56.00 

27.45 


1 1 

5.86 
58.33 
1 7.45 1 

f ^ 


1 4. 18 

41.67 
1 19.61 



188 

78.66 



51 

21.34 



TOTAL 

23 

9.62 



23 

9.62 



24 

10.04 



22 

9.21 



24 

10.04 



24 

10.04 



25 

10.46 



25 

10.46 



25 

10.46 



24 

10.04 



239 

100.00 



Figure 4.15 Lieutenant Colonels (Air) 
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officers with a performance index score less than or equal 
to .847943, shows retention at 91 percent and attrition at 
8.7 percent. Intervals 9 and 10, or those officers with the 
highest performance index, show attrition rates of 56 and 41 
percent respectively. Again, contrary to the earlier groups 
analyzed, this population group shows that officers with the 
highest performance indexes are attriting at the greatest 
rate. 

8. Colonels (All) 

a. Prediction Rates 

With this last data group, there were no 
variables which passed the significance test. Table XII 
shows the P-value and Chi-Square statistics of the variables 
deleted from this model. Because these variables did not 
enter the model, LOGIST assigned the entire population group 
a uniform probability of leaving the service. This was due 
in large part to the relatively small size of the population 
group. The results of this procedure were discarded because 
they merely equated to a random chance probability of 
attrition. Accordingly, further analysis of the variables 
and prediction rates was not conducted for this group. 

b. Performance Index Interval Table 

The colonels displayed similar results when 
compared with the lieutenant colonel's ground and air data 
groups. Attrition is greatest at the highest performance 
INDEX interval. That interval, interval 10, shows an 
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TABLE XII 



VARIABLE SIGNIFICANCE, COLONELS (ALL) 



Variable 



P - Value 



Chi - Square 



PAY 



0.a855 



2.96 



INDEX 



0.2001 



1.64 



UNEMP 



0.4870 



0.48 



EDSPEC 



0.5257 



0.40 



attrition rate of 48 percent for those officers with a score 
greater than or equal to .933774. The remaining intervals 
show a substantial decrease in their attrition rates when 
compared with interval 10. Figure 4.16 shows the results of 
the interval tables by performance INDEX. 

F. SUMMARY 

This concludes the data analysis phase of the authors' 
study of Marine officer attrition. In the following 
chapter, these results will be consolidated for clarity and 
the authors will provide comments and recommendations for 
future studies in this area. 
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COLllNbLS (ALL) WITH 26 YtARS 
TAliLE 'JF INTLRVAL BY OUT 



INTERVAL OUT 

FREQUENCY 
PLkCEN r 
HtiW PCT 
CUL PCT 

^ 0 

INTERVAL 1 



INTERVAL 2 



INTERVAL 3 



INTERVAL ^ 



INTERVAL 5 



INTERVAL 6 



INTERVAL 7 



INTERVAL 0 



INTERVAL 1 



INTERVAL 10 



! ° ! 


! ^ ! 


27 

7.63 

77.14 

9.32 


3 

2.26 
22.86 
10. 13 


30 
8.4 7 
83.33 
10.91 


6 

1.69 

L6.67 

7.59 


27 

7.63 

77.14 

9.82 


8 

2.26 
22.86 
10. 13 


27 
7.63 
77. 14 
9.02 


8 

2.26 

22.06 

10.13 


26 
7.34 
72.22 
9.45 1 


i 

2.82 
27.73 
1 12.66 


23 

7.91 

80.00 

10.13 


7 

1.98 
2 0.00 
d. J6 


31 
8. 76 
86.11 
11.27 


5 

1.41 

13.89 

0.33 


39 
8.47 
0 5.71 
10. 9 1 


5 

1 .41 
14.29 
6.33 


\ 

0. 76 
i6.ll 
1 11.27 


5 

1.41 

13.3; 

6. 5 i 1 


i L ^ 

5.03 
51.43 
1 6.55 


17 
4.80 
4 3.57 
21 . 52 



TOTAL 



l.O >- 



2 75 
77*66 



7 ^ 

22.32 



TOTAL 



35 

9.89 



36 

10.17 



35 

9.89 



35 

9.89 



36 

10.17 



35 

^.8 9 



36 
10. L7 



35 

9.89 



36 

10.17 



35 

9 . 3 ^^ 



35A 

100.00 



Figure 4.16 Colonels (All) 
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V 



SUMMARY 



A. GENERAL 

This chapter will consolidate the results of the output 
discussed in the previous chapter and offer some comments 
and recommendations regarding further research in the area 
of predicting Marine officer attrition. Figure 5.1 depicts, 
by data group, the prediction sums developed by the SAS 
LOGIST function along with actual attrition for FY81 and 
FY82. These figures were transformed into a prediction rate 
which is also included in Figure 5.1. Additionally, 
variables which were retained in the model are listed in 
order of significance while those deleted are presented in a 
similiar format. 

B. PREDICTION RATES 

The overall results of this binary choice model, using 
the SAS LOGIST procedure, were not encouraging. For FY81, 
three of the five data groups analyzed gave a prediction 
rate over 90 percent. At first glance, an ability to 
predict attrition of Marine Officers at a 90 percent or 
better rate would seem to validate the models and thus 
provide a useful tool for manpower planning. But, in all 
three cases, when these same population groups were used to 
forecast FY82 attrition, the prediction rates dropped well 
below the 90 percent benchmark achieved for the FY81 data. 
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Figure 5.1 



Consolidated Results 
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Additionally, the rates obtained for the remaining data 
groups were far below the 90 percent level for both FY81 and 
FY82. Since the results of the tests conducted varied from 
highly successful to totally unacceptable, the logit 
procedure would appear to contain sufficient inaccuracy to 
preclude its use. 

This analysis however, has provided further insight into 
the difficult question of predicting officer attrition. 
This problem is very difficult to solve using statistical 



models. 


Surely, 


there 


are an 


umber of 


variables 


which 


cannot be 


eas i ly 


quan 


tified yet 


play a 


major role 


in an 


off icer 's 


decision 


to 


leave the 


service. 


Some of 


those 


variables. 


such as 


f ami 


ly situati 


on, next duty assignment. 


personal 


contacts 


in 


the civili 


an job 


market and 


self 



perception, would have to be individually modeled for this 
type of analysis to prove statistically relevant to the 
user. Through personal experiences, the authors have 
consistently heard the above reasons given as to why an 
officer has chosen to leave the service. This study seems 
to confirm the diverse nature of this decision. 

Future studies in this area should concentrate on 
evaluating more population groups with additional variables 
over a longer period of time to provide a wider range for 
analysis. Only in this manner can the investigator hope to 
achieve relevant results for this problem through the use of 
standard statistical techniques. 
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C. SIGNIFICANCE OF VARIABi-c,S 

As shown in Figure 5.1, INDEX was the most significant 
variable in each of the five population groups analyzed. 
These results confirmed the perceptions provided by Major 
Esmann during his initial research. Officers interviewed 
for that study felt that promotion potential was a major 
factor when determining whether to remain on active duty or 
leave the service. Further, they felt that a high promotion 
potential would indicate satisfaction with the service while 
a low promotion potential would cause an officer to resign 
his commision [Ref. 9]. While the performance INDEX proved 
significant with all of the population groups, it is of 
particular note that for the three most senior groups 
analyzed (lieutenant colonels. Air and Ground, and 
colonels), the variable had a positive coefficient. The 
inverse relationship of this positive coefficient means that 
as an individual's INDEX score increases at these higher 
ranks, so does the probability that he will leave the Marine 
Corps. Additionally there is the question of whether the 
performance INDEX equates to promotion potential. One major 
shortcoming of this variable in relation to promotion 
potential was that it failed to reflect an officer's level 
of experience or his past job assignments. The assignment 
variable in particular is one of the most significant 
factors considered by promotion boards and is continually 
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stressed throughout the Marine Corps as a vital ingredient 
when planning the "ideal” career. In essence, those Marine 
officers who have been assigned the "better jobs", may in 
fact have lower INDEX scores because of the competitive 
nature of those billets. Nevertheless, these officers will 
still have a greater promotion potential than their 
contemporaries who received a higher INDEX score from a less 
career-enhancing billet. Since the INDEX score has such a 
narrow range, job assignment may in fact be the vital 
ingredient for determining promotion potential. A review of 
the GAO study on quality retention of Marine officers shows 
that a job assignment fator was numerically transformed and 
combined with a performance indicator in order to provide a 
more realistic presentation of promotion potential [Ref. 
10]. Future studies in this area should include a job 
assignment factor to weight the performance INDEX when 
evaluating the effect of promotion potential on attrition. 

While the Esmann study concluded that the unemployment 
variable (UNEMP) was significant in predicting attrition, the 
SAS LOGIST procedure provided quite the opposite results. 
In three of the five data groups analyzed, the unemployment 
variable did not meet the .05 significance level and was 
deleted from the model. In those cases where it remained, 
the population groups were completing their initial 
contracts and reviewing their employment opportunities in 
the civilian community. These groups, first lieutenant 
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(Ground) and captain (Air), were facing similiar decisions 
and the unemployment rate appears to have been an important 
factor in their decision to leave the Marine Corps. For the 
remaining three data groups, the effects of the employment 
situation were of little or no significance. 

During the initial phases of this study it was felt that 
the educational specialty (EDSPEC) variable would be 
significant only to the more junior population groups whose 
recently acquired skills remained a marketable asset. This 
assumption proved correct, with the first lieutenant 
through captain data groups retaining EDSPEC and the 
lieutenant colonel data groups deleting this variable. This 
would seem to indicate that a college major is more 
important to a young officer entering the civilian market 
and less important to the career officer who must sell his 
managerial background rather than what he studied in college 
twenty years ago. 

The MOS variable was input as a independent variable in 
only three of the five population groups analyzed. Of those 
three groups, it was retained by only two as being 
significant, first lieutenants and lieutenant colonels 
(Ground). The retention of this variable and its degree of 
significance in the lieutenant colonels (Ground) model 
should be noted. It may be that MOS is directly correlated 
to the job assignment factor previously discussed. It would 
appear that those officers with combat arms MOS ' s (0302 and 
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0802 ) are in a better position to serve in those assignments 
considered career enhancing while those officers without 
these MOS ' s would not have the opportunity to assume the 
limited number of command billets. While an individual's 
past assignments were not introduced into this study, the 
presence of MOS in this particular model may indicate its 
relationship with promotion potential and therefore 
attrition. 

The last independent variable to be included in this 
study was pay compatibility with the civilian sector. For 
the most part this variable proved insignificant, being 
retained in only two of the eight data groups analyzed. 
However, in the case of the captain (Air), PAY was second to 
INDEX in its level of significance to the model. Although 
not a principle factor in this population group's decision 
to leave the Marine Corps, the appearance of PAY in the 
captain (Air) model still suggests that money is important 
to this group and may explain the payment of bonuses to 
retain the younger naval aviator. 

To summarize, the viability of the performance INDEX 
score as a predictor of promotion potential was not 
determined during this study. However, its value in 
predicting attrition was confirmed. The same can not be 
stated of the other independent variables which nevertheless 
provided some interesting insights into the various elements 
affecting an officer's decision to leave the Marine Corps. 
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D. PERFORMANCE INDEX INTERVALS 

The performance index interval tables were originally 
constructed to assist the authors in determining at what 
INDEX level the greatest attrition was occuring for each 
population group. Using this information in conjunction 
with the SAS LOGIST output, the authors would be able to 
ascertain at which intervals SAS was having the most success 
in predicting attrition. From the results provided, the 
indication is that LOGIST predicts quite well at the extreme 
ends of the tables while the prediction capabilities in the 
middle intervals are far less accurate. 

Through this process the authors discovered that the 
senior officers, lieutenant colonels - year 20, and colonels 
- year 26, with the highest performance index scores were 
attriting at the greatest rate. Table XII shows the 
retention and attrition percentages at the interval 
extremes for each of the population groups analyzed. As 
this table indicates, the percentage of officers staying in 
or leaving the Marine Corps compared to their performance 
index interval is completely reversed between the junior and 
senior officers. If a high performance index equates to 
being a good officer, then the significance of tnis 
information would be that the Marine Corps is losing its 
best officers when they reach the senior ranks. Of course, 
as discussed previously, the performance index may not 
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TABLE XIII 



CONSOLIDATED PERFORMANCE INDEX INTERVALS 



i 

1 Population 




Index 


Percentage 


1 

1 

1 


1 Group Interval 

1 


Value 


In Out 


1 

1 


1 

1 1st Lt (GND) 


1 


<.796879 


47% 52% 


1 

1 


1 

I 


10 


>.955157 


85% 14% 


1 

1 


1 

1 Capt's (GND) 


1 


<.843051 


83% 16% 


1 

1 


1 

1 


10 


>.950598 


89% 10% 


1 

1 


1 

1 Capt's (AIR) 


1 


<.840127 


59% 40% 


1 

1 


1 

1 


10 


>.944003 


91% 8% 


1 

1 


1 

1 Lt Col’s (GND) 


1 


<.838258 


83% 16% 


1 

1 


1 

1 


10 


>.929695 


47% 52% 


1 

1 


1 

1 Lt Col’s (AIR) 


1 


<.847943 


91% 8% 


1 

1 


1 

1 


10 


>.932495 


58% 41% 


1 

1 


1 

1 Col’s (ALL) 


1 


<.857835 


77% 22% 


1 

1 


1 

1 


10 


>.933774 


51% 48% 


1 

1 


indicate the total professional abili 


ty of Marine 


Corps 


senior officers 


because 


it fails 


to consider 


past 


assignments when evaluating 


a Marine’s 


worth to the service. 


Nevertheless, an 


officer 


with twenty 


years of service has 


had a variety of 


assignments with 


different reporting 


seniors. If he has been 


able to maintain an above 


average 


performance index score throughout that 


period, it should be 


a true indicator 


of his 


greater ability and increased 


promotion potential 


• 








The GAO report 


used a job experience factor along 


with a 


performance indicator which 


showed that 


the Marine Corps was 
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retaining its better senior officers. The opposing results 
achieved by this study and the GAO report indicates that 
different methods of evaluating promotion potential will 
yield varying results. Any future studies in this area 
should begin by identifying the value of the performance 
index to determine if it is a true indicator of an officer's 
performance and, therefore, promotion potential. 

E. CONCLUSION 

The overall success rate of the logit function to pre- 
dict Marine officer attrition was less than the authors had 
expected. A possible explanation may lie with the variables 
selected to describe the individuals in the population 
groups. The performance INDEX score is clearly a starting 
point for determining the promotion potential of an officer, 
but his past assignments cannot be overlooked as a signifi- 
cant factor in this regard. While the remaining variables 
provide some interesting insights into the differences be- 
tween the population groups, their usefulness in predicting 
attrition was generally minimal. In retrospect, the 
complexity of this decision, to remain on active duty or 
leave the Corps, may make it impossible to accurately model 
using currently available data. Only with the inclusion of 
a number of additional, difficult to obtain or measure 
variables, can an analyst hope to achieve accurate results 
across all population groups and years of service. 
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